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Abstract. Geometric Langlands duality is usually formulated as a statement 
about Riemann surfaces, but it can be naturally understood as a consequence 
of electric-magnetic duality of four-dimensional gauge theory. This duality in 
turn is naturally understood as a consequence of the existence of a certain 
exotic supersymmetric conformal field theory in six dimensions. The same 
six-dimensional theory also gives a useful framework for understanding some 
recent mathematical results involving a counterpart of geometric Langlands 
duality for complex surfaces. (This article is based on a lecture at the Raoul 
Bott celebration, Montreal, June 2008.) 



1. Introduction 

A d-dimensional quantum field theory (QFT) associates a number, known as 
the partition function Z(Xd), to a closed d- manifold endowed with appropriate 
structure^ Depending on the type of QFT considered, the requisite structure may 
be a smooth structure, a conformal structure, or a Riemannian metric, possibly 
together with an orientation or a spin structure, etc. In physical language, the 
partition function can usually be calculated via a path integral over fields on X. 
However, this lecture will be partly based on an exception to that statement. 

To a closed d — 1-dimcnsional manifold Xd-i (again with some suitable struc- 
ture), a d-dimensional QFT associates a vector space 7i{Xd-x), usually called the 
space of physical states. In the case of a unitary QFT (such as the one associated 
with the Standard Model of particle physics), TC is actually a Hilbert space, not 
just a vector space. The quantum field theories considered in this lecture are not 
necessarily unitary. The partition function associated to the empty <i-manifold is 
Z(0) = 1, and the vector space associated to the empty d— 1-manifold is H(0) = C. 

There is a natural link between these structures. To a d-manifold Xd with 
boundary Xd-i, a d-dimensional QFT associates a vector ipx d € H(Xd-i). (In 
physical terminology, ipx d can usually be computed by performing a path integral 
for fields on Xd that have prescribed behavior along its boundary.) This generalizes 
the partition function, since if Xd-i = 0, then ipx d € TL{0) = C is simply a 
complex number, which is the partition function Z(Xd). 

What I have said so far is essentially rather familiar to physicists. (The reason 
for the word "essentially" in the last sentence is that for most physical applications, 
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a less abstract formulation is adequate.) Less familiar is that it is possible to 
continue the above discussion to lower dimensions. The next step in the hierarchy 
is that to a closed d — 2-manifold Xd-2 (with appropriate structure) one associates 
a category C(Xd-2)- Then, for example, to a d — 1-manifold Xd-i with boundary 
Xd-2, one associates an object V(Xd-i) in the category C(Xd-2). (For relatively 
informal accounts of these matters from different points of view, see [H [2] ; for some 
recent developments, see [3] as well as [4].) 

1.1. Categories And Physics. In practice, physicists do not usually specify 
what should be associated to Xd-2- This is not necessary for most purposes - 
certainly not in standard applications of QFT to particle physics or condensed 
matter physics. However, before getting to the main subject of this talk, I will 
briefly explain a few cases in which that language is or might be useful for physicists. 

So far, the most striking physical application of the "third tier," that is the 
extension of QFT to codimension two, is in string theory, where one uses two- 
dimensional QFT to describe the propagation of a string. In this case, since d = 2, 
ad— 2-manifold is just a point. So the extra layer of structure is just that the 
theory is endowed with a category C, which is the category of what physicists call 
boundary conditions in the quantum field theory, or D-branes. 

For d = 2, a connected d— 1-manifold with boundary is simply a closed interval 
/, whose boundary consists of two points. To define a space TC(I) of physical states 
of the open string, one needs boundary conditions B and B' at the two ends of /. To 
emphasize the dependence on the boundary conditions, the space of physical states 
is better denoted as Tt(I; B, B'). In category language, this space of physical states 
is called the space of morphisms in the category, Homc(B, B 1 ). (This construction 
has two variants that differ by whether the manifolds considered are oriented; they 
are both relevant to string theory.) 

Another case in which the third tier can be usefully invoked, in practice, is 
three-dimensional Chern-Simons gauge theory. This is a quantum field theory for 
d = 3 with a compact gauge group G and a Lagrangian that is, roughly speaking^ 
an integer k times the Chern-Simons functional. A closed d — 2-manifold is now a 
circle, and again, the extra layer of structure is that a category C is associated to 
the theory; it is the category of positive energy representations of the loop group 
of G at level k. 

Finally, the state of the Universe in the presence of a black hole or a cosmological 
horizon is sometimes described in terms of a density matrix rather than an ordinary 
quantum state, to account for one's ignorance of what lies beyond the horizon. This 
point of view (which notably has been advocated by Stephen Hawking) can possibly 
be usefully reformulated or refined in terms of categories. The idea here would be 
that, in d-dimensional spacetime, the horizon of a black hole (or a cosmological 
horizon) is a closed d — 2-manifold. Indeed, suppose that Xd is a (i-dimensional 
Lorentz signature spacetime with an "initial time" hypersurface X^-i- Suppose 
further that a black hole is present; its horizon intersects Xd-\ on a codimension 
two submanifold Xd-2- It is plausible that to Xd-2, we should associate a category 
C, and then to Xd-\ we would associate not - as we would in the absence of the 
black hole - a physical Hilbert space H(Xd-i) - but rather an object V in that 
category. 

2 This formulation suffices if G is simple, connected, and simply-connected. In general, k is 
an element of H 4 (BG, 1). 
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To make this more concrete, suppose for example that C is the category of 
representations of an algebra S. Then V is an 5-module, which in this context 
would mean a Hilbert space Xd-2) with an action of S. Physical operators 

would be operators on this Hilbert space that commute with S. Intuitively S is 
generated by operators that act behind the horizon of the black hole. (That cannot 
be a precise description in quantum gravity, where the position of the horizon can 
fluctuate.) This point of view is most interesting if the algebra S is not of Type 
I, so that it does not have irreducible modules and the category of 5-modules is 
not equivalent to the category of vector spaces. At any rate, even if the categorical 
language is relevant to quantum black holes, it may be oversimplified to suppose 
that C is the category of representations of some algebra. 

1.2. Geometric Langlands. Our aim here, however, is to understand not 
black holes but the geometric Langlands correspondence. In this subject, one stud- 
ies a Riemann surface G, but the basic statements that one makes are about cate- 
gories associated to G. Indeed, the basic statement is that two categories associated 
to G are equivalent to each other. 

For G a simple complex Lie group, let Yc(C) = Hom(7Ti(C), G) be the moduli 
stack of flat G-bundles over C. And let Zq{C) be the moduli stack of holomorphic 
G-bundles over C. 

To the group G, we associate its Langlands [5] or GNO [6] dual group G v . 
(The root lattice of G is the coroot lattice of G v , and vice- versa.) Then the basic 
assertion of the geometric Langlands correspondence [7j is that the category of 
coherent sheaves on Yqv (G) is naturally equivalent to the category of P-modulcs 
on Z G {C). 

If we are going to interpret this statement in the context of quantum field 
theory, we should start with a theory in dimension d — 4, so that it will associate 
a category to a manifold of dimension d — 2 = 2, in this case the two-manifold G. 
We need then an equivalence between a quantum field theory defined using G and 
a quantum field theory defined using G v , both in four dimensions. In fact, there 
is a completely canonical theory with the right properties. It is the maximally 
supersymmctric Yang-Mills theory in four dimensions. 

This theory, which has Af — 4 supersymmetry, depends on the choice of a com- 
paclQ gauge group G. It also depends on the choice of a complex- valued quadratic 
form on the Lie algebra g of G; the imaginary part of this quadratic form is re- 
quired to be positive definite. If G is simple, then Lie theory lets us define a natural 
invariant quadratic form on q (short coroots have length squared 2), and any such 
form is a complex multiple of this one. We write the multiple as 

, . 6 4ni 

(L1) T = Y, + ^ 

where e and 9 (known as the gauge coupling constant and theta-angle) are real. 
We call r the coupling parameter. 

The classic statement (which evolved from early ideas of Montonen and Olive 
[5]) is that JV = 4 super Yang- Mills theory with gauge group G and coupling 



J In the formulation via gauge theory, we begin with a compact gauge group, whose complex- 
ification then naturally appears by the time one makes contact with the usual statements about 
geometric Langlands. Geometric Langlands is usually described in terms of this complexification. 
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parameter t is equivalent to the same theory with dual gauge group G v and coupling 
parameter 

(1.2) r v = -l/n B r. 

(Here n g is the ratio of length squared of long and short roots of G or G v .) The 
equivalence between the two theories exchanges electric and magnetic fields, in a 
suitable sense, and is known as electric-magnetic duality. There are also equiva- 
lences under 

(1.3) r -> r+1, t v — ► t v + 1, 

that can be seen semiclassically (as a reflection of the fact that the instanton number 
of a classical gauge field is integer- valued) . The non-classical equivalence (|1.2|) 
combines with the semiclassical equivalences (|1.3[) to an infinite discrete structure. 
For instance, if G is simply-laced, then n g = 1, G and G v have the same Lie 
algebra, for many purposes one can ignore the distinction between r and t v , and 
the symmetries (jl.2[) and (|1.3[) generate an action of the infinite discrete group 
SX(2,Z) on t. 

There is a "twisting procedure" to construct topological quantum field theories 
(TQFT's) from physical ones. Applied to N = 2 super Yang-Mills theory, this 
procedure leads to Donaldson theory of smooth four-manifolds. Applied to J\f = 4 
super Yang-Mills theory, the twisting procedure leads to three possible construc- 
tions. Two of these are quite similar to Donaldson theory in their content, while 
the third is related to geometric Langlands [9]. 

The equivalence between this third twisting for the two groups G and G v 
(and with an inversion of the coupling parameter) leads precisely at the level of 
"categories," that is for two-manifolds, to the geometric Langlands correspondence. 
(The underlying electric-magnetic duality treats G and G v symmetrically. But the 
twisting depends on a complex parameter; the choice of this parameter breaks the 
symmetry between G and G v . That is why the usual statement of the geometric 
Langlands correspondence treats G and G v asymmetrically.) 

So this is the basic reason that geometric Langlands duality, most commonly 
understood as a statement about Riemann surfaces, arises from a quantum field 
theory in four dimensions. 

Remark 1.1. For another explanation of why four dimensions is a natural 
starting point for geometric Langlands, see |10j . This explanation uses the fact 
that the mathematical theory as usually developed is based on moduli stacks rather 
than moduli spaces; but a two-dimensional sigma model whose target is the moduli 
stack of bundles is best understood as a four-dimensional gauge theory. This relies 
on the gauge theory interpretation of the moduli stack, introduced in a well-known 
paper by Atiyah and Bott [11] , 

2. Defects Of Various Dimension 

In the title of this talk, I promised to get up to six dimensions, not just four. 
Eventually we will, but first we will survey the role of structures of different dimen- 
sion in a four-manifold. 
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Suppose that a quantum field theory on a manifold M is defined by some sort 
of path integral, schematically 




where L is a Lagrangian density that depends on some fields A (and perhaps on 
additional fields that are not written). "Inserting a local operator 0(p) at a point 
p G M" means modifying the path integral at that point. This may be done 
by including a factor in the path integral that depends on the fields and their 
derivatives only at p. It may also be done in some more exotic way, such as by 
prescribing a singularity that the fields should have near p. 

In addition to local operators, we can also consider modifications of the theory 
that are supported on a p-dimensional submanifold N C M. We give some examples 
shortly. A local operator is the case p = 0. The general case we call a p-manifold 
operator. 

In much of physics, the important operators are local operators. This is also 
the case in Donaldson theory. The local operators that are important in Donaldson 
theory are related to characteristic classes of the universal bundle. 

I should point out that geometrically, a local operator may be a tensor field 
of some sort on M; it may be, for example, a g-form for some q. If O q is a local 
operator valued in g-forms, we can integrate it over a q-cycle W q C M to get 
Jw ®i m ^ ne mos t important operators in Donaldson theory are of this kind, with 
q = 2. For our purposes, we need not distinguish a local operator from such an 
integral of one. (What we call a p-manifold operator cannot be expressed as an 
integral of q- manifold operators with q < p.) 

Local operators also play a role in geometric Langlands. Indeed, a construction 
analogous to that of Donaldson is relevant. Imitating the construction of Donaldson 
theory and then applying electric-magnetic duality, one arrives at results, many of 
which arc known in the mathematical literature, comparing group theory of G to 
cohomology of certain orbits in the affine Grassmannian of G v . 

But local operators are not the whole story. In gauge theory, for example, given 
an oriented circle S C M, and a representation R of G, we can form the trace of 
the holonomy of the connection A around S in the given representation. Physicists 
denote this as 

(2.2) W R (S)=Ti R Pex.p(- ^A^. 

When included as a factor in a quantum path integral, W R (S) is known as a Wilson 
operator. Wilson operators were introduced over thirty years ago in formulating a 
criterion for quark confinement in the theory of the strong interactions. 

Wr(S) cannot be expressed as the integral over S of a local operator. We call 
it a one-manifold operator. 

Electric-magnetic duality inevitably converts W R (S) to another one-manifold 
operator, which was described by 't Hooft in the late 1970's. The 't Hooft operator 
is defined by prescribing a singularity that the fields should have along S. (See 
[9j for a review. Operators defined in this way are often called disorder operators, 
while operators like the Wilson operator that are defined by interpreting a clas- 
sical expression in quantum mechanics are called order operators.) The possible 
singularities in G gauge theory are in natural correspondence with representations 
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i? v of the dual group G v . Electric-magnetic duality maps a Wilson operator in 
G v gauge theory associated with a representation i? v to an 't Hooft operator in G 
gauge theory that is also associated with i? v . 

If one specializes to the situation usually studied in the geometric Langlands 
correspondence, the 't Hooft operators correspond to the usual geometric Hecke 
operators of that subject. The electric-magnetic duality between Wilson and 't 
Hooft operators leads to the usual statement that a coherent sheaf on Yqv (C) that 
is supported at a point is dual to a Hecke eigensheaf on Zq(C). (Saying that a 
©-module on Zq(C) is a Hecke eigensheaf is the geometric analog of saying that a 
classical modular form is a Hecke eigenform.) 

Moving up the chain, the next step is a two-manifold operator. In general, in 
rf-dimensional gauge theory, one can define ad— 2-manifold operator as follows. 
One omits from M a codimension two submanifold L. Then, fixing a conjugacy 
class in G, one considers gauge fields on M\L with holonomy around L in the 
prescribed conjugacy class. 

For d = 4, we have d — 2 = 2, so L is a two-manifold. Classical gauge theory 
in the presence of a singularity of this kind has been studied in the context of 
Donaldson theory by Kronheimer and Mrowka. In geometric Langlands, to get a 
class of two-manifold operators that is invariant under electric-magnetic duality, 
one must incorporate certain quantum parameters in addition to the holonomy 
[12] . Once one does this, one gets a natural quantum field theory framework for 
understanding "ramification," i.e. the geometric Langlands analog of ramification 
in number theory. 

The next case are operators supported on a three-manifold W C M . With M 
being of dimension four, W is of codimension one and locally divides M into two 
pieces. The theory of such three-manifold operators is extremely rich and [131 114] 
there are many interesting constructions, even if one requires that they should 
preserve the maximum possible amount of supersymmetry (half of the supersym- 
metry) . 

For example, the gauge group can jump in crossing W . We may have G gauge 
theory one side and H gauge theory on the other. If if is a subgroup of G, a 
construction is possible that is related to what Langlands calls functoriality. Other 
universal constructions of geometric Langlands - including the universal kernel that 
implements the duality - are similarly related to supersymmetric three-manifold 
operators. 

As long as we are in four dimensions, this is the end of the road for modifying 
a theory on a submanifold. A modification in four dimensions would just mean 
studying a different theory. So to continue the lecture, we will, as promised in the 
title, try to relate geometric Langlands to a phenomenon above four dimensions. 



3. Selfdual Gerbe Theory In Six Dimensions 

Until relatively recently, it was believed that four was the maximum dimen- 
sion for nontrivial (nonlinear or non-Gaussian) quantum field theory. One of the 
surprising developments coming from string theory is that nontrivial quantum field 
theories exist up to (at least) six dimensions. 

To set the stage, I will begin by sketching a linear, but subtle, quantum field 
theory in six dimensions. The nonlinear case is discussed in section [4] 
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In six dimensions, with Lorentz signature — h + + H — h, a real three-form H can 
be selfdual, obeying H = *H, where * is the Hodge star operator]! Let us consider 
such an H and endow it with a hyperbolic equation of motion 

(3.1) dH = 0. 

That equation is analogous to the Bianchi identity dF = for the curvature two- 
form F of a line bundle. It means that (in a mathematical language that physicists 
generally do not use) H can be interpreted as the curvature of a £7(1) gerbe with 
connection. 

In contrast to gauge theory, there is no way to derive this system from an 
action. The natural candidate for an action, on a six-manifold Mq, would seem to 
be J M H A *H, but if H is self-dual this is the same as J Mg H A H = 0. 

Nevertheless, there is a quantum field theory of the closed, selfdual H field. To 
explain how one part of the structure of quantum field theory emerges, suppose that 
the Lorentz signature six-manifold Mq admits a global Cauchy hypersurface M5. 
M5 is thus a five-dimensional Riemannian manifold. Fixing the topological type of 
a U{\) gerbe in a neighborhood of M5, the space of gerbe connections with selfdual 
curvature, modulo gauge transformations, is an (infinite-dimensional) symplectic 
manifold in a natural way. (Roughly speaking, if B is the gerbe connection, then 
the symplectic form is defined by the formula ui — f M SB A dSB.) Quantizing 
this space, we get a Hilbert space associated to M 5 . This association of a Hilbcrt 
space to a five-manifold is part of the usual data of a six-dimensional quantum field 
theory. The rest of the structure can also be found, with some effort. (For a little 
more detail, see [15 1 ITBl H7].) 

An important fact is that the quantum field theory of the H field is conformally 
invariant. Classically, the equations H = *H , dH = 0, are conformally invariant. 
The passage to quantum mechanics preserves this property, because the theory is 
linear. 

Now let us consider the special case that our six-manifolc0 takes the form 
Mq = M4 x T 2 , where M4 is a four-manifold and T 2 is a two-torus. We assume a 
product conformal structure on M4 x T 2 . After making a conformal rescaling to 
put the metric on T 2 in a standard form (say a flat metric of unit area), we are left 
with a Riemannian metric on M 4 . The conformal structure of T 2 is determined by 
the choice of a point r in the upper half of the complex plane - modulo the action 
ofSX(2,Z). 

Next in M4 x T 2 , let us keep fixed the second factor, with a definite metric, and 
let the first factor vary. We let M4 be an arbitrary four-manifold with boundaries, 
corners, etc. Starting with a conformal field theory on Mq, this process gives us 
a four-dimensional quantum field theory (not conformally invariant) that depends 
on t as a parameter. Clearly, the induced four-dimensional theory depends on 
the conformal structure of T 2 only up to isomorphism. So if we parametrize the 



The quantum theory of a real selfdual threeform in six dimensions can be analytically 
continued to Euclidean signature, whereupon H is still selfdual but is no longer real. Such a 
continuation will be made later. In general, analytic continuation from Lorentz to Euclidean 
signature and back is an important tool in quantum field theory; the basic reason that it is 
possible is that in Lorentz signature the energy is non-negative. 

^Henceforth, and until section [5.31 we generally work in Euclidean signature, using the ana- 
lytic continuation mentioned in footnote [4] 
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induced four-dimensional theory by t, we will have a symmetry under the action 
of SX(2,Z) on r. 

The induced four-dimensional quantum field theory is actually closely related 
to £7(1) gauge theory, which is its "infrared limit." Let us think of T 2 as C/A, 
where C is the complex plane parametrized by z — x + iy and A is the lattice 
generated by complex numbers 1 and r. Further, make an ansatz 

(3.2) H = F A dx + *F A dy, 

where F is a two- form on M4 (pulled back to Mq — M4 x T 2 ), and * is the four- 
dimensional Hodge star operator. Then the equations dH = become Maxwell's 
equations 



This gives an embedding of four-dimensional £7(1) gauge theory in the six- 
dimensional theory. To be more precise, we should think of H as the curvature of a 
17(1) gerbe connection; then F is the curvature of a £7(1) connection. Of course, we 
have described the embedding classically, but it also works quantum mechanically. 

This construction is more than an embedding of four-dimensional £7(1) gauge 
theory in a six-dimensional theory. The four-dimensional £7(1) gauge theory is the 
infrared limit of the six-dimensional theory in the following sense. We have endowed 
Mq with a product metric g§ that we can write schematically as gg = 54 © g2, 
where 54 and <?2 are metrics on M4 and T 2 , respectively. Now we modify g§ to 
ge(t) — t 2 g^ © 32, where t is a real parameter. The claim is that for t — > 00, the 
theory on Mq converges to £7(1) gauge theory on M4. (This theory is conformally 
invariant, so the t 2 factor in the metric of M4 can be dropped.) This is usually 
described more briefly by saying that £7(1) gauge theory on M4 is the long distance 
or infrared limit of the underlying theory on Mq. 

Even though £7(1) gauge theory on M\ gives an effective and useful description 
of the large t limit of the six-dimensional theory on Me, something is obscured in 
this description. The process of compactifying on T 2 and taking the large t limit 
is canonical in that it depends only on the geometry of T 2 and not on a choice of 
coordinates. But to go to a description by £7(1) gauge theory, we used the ansatz 
(|3.2p . which depended on a choice of coordinates x and y. As a result, some of the 
underlying symmetry is hidden in the description by £7(1) gauge theory. 

Concretely, though the six-dimensional theory does not have a Lagrangian, the 
four-dimensional £7(1) gauge theory does have one: 



The coupling parameter 



of the abelian gauge theory is simply the r-parameter of the T 2 in the underlying 
six-dimensional description. 

The six-dimensional theory depends on t only modulo the usual SL(2, Z) equiv- 
alence r — > (ar + b) / (cr + d) , with integers a, 6, c, d obeying ad — be = 1, since values 
of t that differ by the action of SX(2,Z) correspond to equivalent tori. Therefore, 
the limiting four-dimensional £7(1) gauge theory must also have SL(2, Z) symmetry. 
However, there is no such classical symmetry. Manifest SL(2, Z) symmetry was lost 



(3.3) 



dF = d*F = 0. 



(3.4) 




GEOMETRIC LANGLANDS FROM SIX DIMENSIONS 



9 



in the reduction from six to four dimensions, because the ansatz (|3.2j) . which was 
the key step in reducing to four dimensions, is not SL(2, Z)-invariant. Hence this 
ansatz leads to a four-dimensional theory with a "hidden" SL(2, Z) symmetry, one 
which relates the description by a U(X) gauge field with curvature F to a different 
description by a different U(l) gauge field with another curvature form (which, 
roughly speaking, is related to F by the action of SL(2, 1)). 

What we get this way is an SX(2, Z) symmetry of quantum t/(l) gauge theory 
that does not arise from a symmetry of the classical theory. To physicists, this 
symmetry is known as electric-magnetic duality. The name is motivated by the 
fact that an exchange (x, y) — > (y, — x) in (|3.2p . which is a special case of SL(2, Z), 
would exchange F and *F, and thus in nonrclativistic terminology would exchange 
electric and magnetic fields. 

So we have seen that electric- magnetic duality in t/(l) gauge theory in four 
dimensions follows from the existence of a suitable conformal field theory in six 
dimensions [18] . The starting point in this particularly nice explanation is the ex- 
istence in six dimensions of a quantum theory of a gerbe with selfdual curvature. (It 
is also possible to demonstrate the four-dimensional duality by a direct calculation, 
involving a sort of Fourier transform in field space; see [19] .) 

4. The Nonabelian Case 

Since there is not a good notion classically of a gerbe whose structure group is a 
simple nonabelian Lie group, one might think that it is too optimistic to look for an 
analogous explanation of electric-magnetic duality for nonabelian groups. However, 
it turns out that such an explanation does exist - in the maximally supersymmetric 
case. 

The picture is simplest to describe if G is simply-laced, in which case G and G v 
have the same Lie algebra (and to begin with, we will ignore the difference between 
them, though this is precisely correct only if G = E$; a more complete picture can 
be found in section [4~Tjl . For G to be simply-laced is equivalent to the condition 
that n g = 1 in cqn. (jl.2|) . For many purposes, we can ignore the difference between 
t and t v , and then the quantum duality (|1.2|) and the semiclassical equivalence 
(|1.3|) combine to an action of SX(2,Z) on r. 

For every simply-laced Lie group G, there is a six-dimensional conformal field 
theory that in some sense is associated with gerbes of type G. The theory is highly 
supersymmetric, so supersymmetry is essential in what follows. The existence of 
this theory was discovered in string theory in the mid-1990's. (The first hint |20j 
came by considering Type IIB superstring theory at an ADE singularity.) Its exis- 
tence is probably our best explanation of electric-magnetic duality - and therefore, 
in particular, of geometric Langlands duality. It is, in the jargon of quantum field 
theory, an isolated, non-Gaussian conformal field theory This means among other 
things that it cannot be properly described in terms of classical notions such as 
partial differential equations. 

However, it has two basic properties which in a sense justify thinking of it as 
a quantum theory of nonabelian gerbes. Each property involves a perturbation of 
some kind that causes a simplification to a theory that can be given a classical 
description. The two perturbations are as follows: 

(1) After a perturbation in the vacuum expectation values of certain fields 
(which are analogous to the conjectured Higgs field of particle physics), the theory 
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reduces at low energies to a theory of gerbes, with selfdual curvature, and structure 
group the maximal torus T of G. This notion does make sense classically, since 
T is abelian. In fact, the selfdual gerbe theory of T is much like the U(l) theory 
described in Section 3, with U(l) replaced by T. (Supersymmetry plays a fairly 
minor role in the abelian case.) 

(2) Let Mq = M5 x S 1 be the product of a five-manifold M5 with a circle; we 
endow it with a product metric ge — g$ © g\. The six-dimensional theory on Mq 
has a description (valid at long wavelengths) in terms of G gauge fields (and other 
fields related to them by supersymmetry) on M5, but this description involves a 
highly nonclassical trick. If the circle factor of Mg = Mg x S 1 has radius R, then 
the effective action for the gauge fields in five dimensions is inversely proportional 
to R: 

The factor of multiplying the action is a simple consequence of conformal in- 
variance in six dimensions. (Under multiplication of the metric of Mg by a positive 
constant t 2 , the Hodge operator * mapping two- forms to three- forms in five dimen- 
sions is multiplied by t, while R is also multiplied by t, so the action in (|4.1|) is 
invariant.) Though easily understood, this result is highly nonclassical. Eqn. (|4.1|) 
is a classical Lagrangian for gauge fields in five dimensions. Can it arise from a 
classical Lagrangian for gauge fields on Mq = M5 x S 1 ? Given a six-dimensional 
Lagrangian for gauge fields, we would reduce to a five-dimensional Lagrangian (for 
fields that are pulled back from M5) by integrating over the fibers of the projection 
A/5 x S 1 — * M5. This would give a factor of R multiplying the five-dimensional 
action, not J? -1 . So a theory that leads to the effective action (|4.1|) cannot arise in 
this way. The theory in six dimensions should be, in some sense, not a gauge the- 
ory but a gerbe theory instead, but this does not exist classically in the nonabelian 
case. 

What I have said so far is that the same six-dimensional quantum field theory 
can be simplified to either (i) a six-dimensional theory of abelian gerbes, or (ii) a 
five-dimensional theory with a simple non-abelian gauge group. The two statements 
together show that one cannot do justice to this theory in terms of either gauge 
fields (as opposed to gerbes) or abelian groups (as opposed to non-abelian ones). 

Now let us look more closely at the implications of the peculiar factor of 1 /R in 
(|4.ip . We will study what happens for Mq — M4 x T 2 , the same decomposition that 
we used in studying the abelian gerbe theory in section [3l However, for simplicity 
we will take T 2 = S 1 x S 1 to be the orthogonal product of a circle S 1 of radius R 
and a second circle S 1 of radius S. The tau parameter of such a torus (which is 
made by identifying the sides of a rectangle of height and width 2nR and 2nS) is 

, . S R 

(4.2) t = 1— or r = 1-, 

depending on how one identifies the rectangle with a standard one. The two values 
of r differ by 

(4.3) r-— . 

T 

We first view the six-manifold M 6 as M 6 = M 5 x S 1 , where M 5 = M 4 x S 1 . 
The six-dimensional theory on M 6 reduces at long distances to a supersymmetric 
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gauge theory on M5. According to (|4.1|) . the action for the gauge fields is 



Now if Mi is much larger than S , then at long distances we can assume that the 
fields are invariant under rotation of S 1 and we can deduce an effective action in 
four dimensions by integration over the fiber of the projection M4 x S 1 — > M4. This 
second step is purely classical, so it gives a factor of S. The effective action in four 
dimensions is thus 



The important point is that this formula is not symmetric in S and R, even 
though they enter symmetrically in the starting point M§ = M4 x S 1 x S . Had we 
exchanged the two circles before beginning this procedure, we would have arrived at 
the same formula for the four-dimensional effective action, but with S/R replaced 



Looking back to (|4. 2[) and (|4.3[) . we see that the two formulas differ by r — > 
— 1/t. Thus, we have deducecQthat for simply-laced G, the four-dimensional gauge 
theory that corresponds to the maximally supersymmetric completion of (|4.5j) has a 
quantum symmetry that acts on the coupling parameter by r — > —1/t. This is the 
electric-magnetic duality that has many applications in physics and also underlies 
geometric Langlands duality. What we have gained is a better understanding of 
why it is true in the nonabelian case. 

Remark 4.1. Unfortunately, despite its importance, there is no illuminating 
and widely used name for the six-dimensional QFT whose existence underlies du- 
ality in this way. According to Nahm's theorem [21] . the superconformal sym- 
metry group of a superconformal field theory in six dimensions, when formulated 
in Minkowski spacctime, is OSp(2, 6|2r) for some r. The known examples have 
r = 1 or 2, and the theory with the properties that I have just described is the 
"maximally symmetric" one with r — 2. This theory is rather inelegantly called 
the six-dimensional (0, 2) model of type G, where 2 is the value of r (and the 
redundant-looking number involves a comparison to six-dimensional models that 
are supersymmetric but not conformal). 

Remark 4.2. The bosonic subgroup of OSp(2, 6\2r) is 50(2, 6) x Sp(2r), where 
50(2,6) is the conformal group in six dimensions, and Sp(2r) is an "internal sym- 
metry group" (it acts trivially on spacetime) and is known as the i?-symmctry 
group. Thus, the i?-symmetry group of the (0, 2) model is Sp(A). This is the group 
(sometimes called Sp(2)) of 2 x 2 unitary matrices of quaternions; its fundamental 
representation is of quaternionic dimension 2, complex dimension 4, or real dimen- 
sion 8. Sp(i) is also the group that acts on the cohomology of a hyper-Kahlcr 
manifold; this is no coincidence, as we will see later. 

4.1. The Space Of Conformal Blocks. Among the simple Lie groups, only 
Eg is simply-connected and has a trivial center. Equivalently, its root lattice F 
endowed with the usual quadratic form is unimodular, that is, equal to its dual 



To keep the derivation simple, we considered only a rectangular torus. It is possible by using 
eqn. 1)5. 2} to similarly analyze the case of a general torus. 



(4.4) 




(4.5) 




by R/S. 
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r v . In general, if G is simple, simply-laced, and simply-connected, its center is 
Z = r v /r, and the quadratic form on T leads to a perfect pairing 

(4.6) Z x Z ->■ R/Z = U(l). 

(The pairing actually takes values in the subgroup Z„ of U(l), where n is the 
smallest integer that annihilates Z.) 

What has been said so far is sufficient for E%, but more generally, a refinement 
is necessary. (Most of this article does not depend on the following details.) For 
orientation, consider two-dimensional current algebra (that is, the holomorphic 
part of the WZW modefl) of the simply-connected and simply-laced group G at 
level 1. This theory, formulated on a closed Riemann surface W, does not have a 
unique partition function (which is required in the usual axioms of quantum field 
theory, as indicated in the introduction to this article). Rather, it has a vector 
space of possible partition functions, known as the space of conformal blocks. This 
vector space (in the particular case of a simply-laced group G at level 1) can be 
constructed as follows. The pairing (|4. 6|) together with the intersection pairing on 
the cohomology of W leads to a perfect pairing 

(4.7) H l (W,Z)xH 1 (W,Z)^U{\). 
This pairing enables us to define a Heisenberg group extension 

(4.8) 1 — *■ Z7(l) — » F — > H X (W, Z) — > 0. 

Up to isomorphism, the group F has a single faithful irreducible representation 7Z 
in which U(l) acts in the natural way; it is obtained by "quantizing" the finite 
group H 1 (W, Z). One picks a decomposition of H 1 (W 1 Z) as A x B, where A and 
B (which can be constructed using a system of ^-cycles and -B-cycles on W) are 
maximal subgroups on which the extension 1(4.80 is trivial. One then lets B act 
by multiplication - in the sense that 1Z is the direct sum of all one-dimensional 
characters of B. Since (|4.7p restricts to a perfect pairing Ax B ^ U(l), characters 
of B correspond to elements of A. Thus, 1Z has a unitary basis consisting of elements 
ip a , a € A; the action of A is a(i/v) = ipaa' > while B acts by bip a = exp(27ri(6, a))ip a 
(where exp(27ri(6, a)) denotes the pairing between A and B). The dimension of 1Z 
is thus (#Z) S , where g is the genus of W and #Z is the order of Z. 

The space of conformal blocks of the level 1 holomorphic WZW model on a 
Riemann surface W with a simple and simply-laced symmetry group G is isomorphic 
to 1Z. Thus, for G ^ E$, the space of conformal blocks has dimension bigger than 
1. That means that this theory does not have a distinguished partition function 
and so does not quite obey the full axioms of quantum field theory. One may either 
relax the axioms slightly, study the ordinary (non-holomorphic) WZW model, or 
in some other way include holomorphic or non-holomorphic degrees of freedom so 
as to be able to define a distinguished partition function. 

The situation in the six-dimensional (0, 2) theory is similar, with the finite 
group H 3 (Mq, Z) playing the role of H 1 (W,Z) in two dimensions. From (|4.6|) and 



There is no satisfactory terminology in general use. The WZW model is really |22| a 
two-dimensional quantum field theory that is modular-invariant but neither holomorphic nor an- 
tiholomorphic. Its holomorphic part corresponds to what physicists know as two-dimensional 
current algebra (which is a much older construction than the WZW model). But the phrase 
"two-dimensional current algebra" is not well-known to mathematicians, and may even be unclear 
nowadays to physicists. 
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Poincare duality, we again have a perfect pairing H 3 (M 6 , Z) x H 3 (M 6 , Z) -> U(l), 
leading to a Heisenberg group extension 

(4.9) 1 —> U(l) —> F —> H 3 (M 6 , Z) — > 0. 

Again, up to isomorphism, F has a unique faithful irreducible module T with nat- 
ural action of U{1). The theory on a general six-manifold has a space of conformal 
blocks that is isomorphic to T. For G a simple and simply-laced Lie group that 
is not of type Eg, this again represents a slight departure from the usual axioms 
of quantum field theory. Our options are analogous to what they were in the two- 
dimensional case: live with it (which will be our choice in the present paper) or 
consider various more elaborate constructions in which one can avoid the problem. 

Now let us consider an illuminating example. We take M 6 = M 5 x S 1 . We have 
a decomposition H 3 (M 6 ,Z) = H 2 (M 5 ,Z) H 3 (M 5 ,Z). Calling the summands A 
and B, we can as above construct the space T of conformal blocks as the direct sum 
of characters of B. Hence, as in the two-dimensional case, T has a basis consisting 
of elements i,a£i = H 2 (M 5 , Z). 

On the other hand, the (0, 2) model on Mq = M5 x S 1 is supposed to be 
related to gauge theory on M 5 . So in gauge theory on M 5 , we should find a way to 
define a partition function for every a £ H 2 (M 5 , Z). This is easily done once one 
appreciates that one should use the adjoint form of the group, which we will call 
G^. A Gad bundle over any space X has a characteristic class a £ H 2 (X, Z) (where 
Z is the center of the simply-connected group G or equivalently the fundamental 
group of Gad). In Gad gauge theory on M5, we define for every a £ H 2 (M$, Z) a 
corresponding partition function Z a by summing the path integral of the theory 
over all bundles whose characteristic class equals a. 

In defining the Z a , we are relaxing the usual axioms of quantum field theory a 
little bit. If the gauge group is supposed to be G, the characteristic class must vanish 
and the partition function is essentially Z - (I will omit some elementary factors 
involving the order of Z.) If the gauge group is supposed to be Gad, all values of 
the characteristic class are allowed and the partition function is ^2 a Z a . For groups 
intermediate between G and Gad, certain formulas intermediate between those two 
will arise. But for no choice of gauge group is the partition function precisely Z a , 
for some fixed and nonzero a. Clearly, on the other hand, it is natural to permit 
ourselves to study these functions. So this is a situation in which we probably want 
to be willing to slightly generalize the usual axioms of quantum field theory. 

Now as before let us consider the case M 5 = M4 x S 1 , where S 1 is another circle, 
so that M 6 = M4 x S 1 x S 1 can be viewed in more than one way as the product 
of a circle and a five-manifold. For simplicity, let us assume that H 1 (M4, Z) = 
H 3 (M 4 , Z) = 0. Then H 3 (M 6 , Z) = A® B, where 

(4.10) A = H 2 (M 4 ,Z)<g)H 1 (S 1 ,Z), B = H 2 {M 4 ,Z) ® i? 1 ^ 1 , Z). 

The extension is trivial on both A and B. Reasoning as above, the space T of 
conformal blocks has a basis Vv, a £ A. On the other hand, exchanging the roles of 
A and B, it has a second basis tpf,, b £ B. As is usual in quantization, the relation 
between these two bases (which are analogous to "position space" and "momentum 
space") is given by a Fourier transform. In the present case, both A and B can be 
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identified with H 2 (Mi, Z) and the Fourier transform is a finite sum: 

(4.11) i)i = C exp(27™(a,6))V>a. 

a£H 2 {M 4 ,Z) 

Here C is a constant and we write exp(27ri(a, b)) for the perfect pairing H 2 (M 4 , Z) x 
H 2 {M A ,Z)^U{1). 

Let us interpret this formula in four-dimensional gauge theory. In G^j gauge 
theory on M4, we can as before define a partition function Z a by summing over 
bundles with a fixed characteristic class a <E H 2 (M4, Z). Identifying these with the 
tf> a , we find that under electric-magnetic duality the Z a must transform by 

(4.12) Z b {-l/T) = G^ exp(2wi(b,a))Z a {T). 

a 

We have incorporated the fact that (because it exchanges the last two factors in 
A/6 = M4 x S 1 x S 1 ) electric-magnetic duality inverts r, in addition to its action 
on the label a. This formula was first obtained in purely four-dimensional terms in 
[23j . where more detail can be found. Here we have given a six-dimensional context 
for this result. 

If G is a simply-laced and simply-connected Lie group, then its GNO or Lang- 
lands dual group G v is precisely the adjoint group G^. Apart from elementary 
constant factors that are considered in [23] . the partition function of the theory 
with gauge group G is Zq (since a must vanish if the gauge group is the simply- 
connected form G), and the partition function of the theory with gauge group 
is J2 a Z a (since all choices of a are equally allowed if the gauge group is the ad- 
joint form). As noted in [23] . a special case of (|4.12l) is that Zq transforms under 
t — > — 1/t into a constant multiple of ^2 a Z a - This assertion means that in this 
particular case the G and G v theories are dual. Other specializations of (|4.12j) 
correspond to duality for forms intermediate between G and G^i, but in general 
(|4.12p contains more information than can be extracted from such special cases. 

Remark 4.3. The close analogy between the conformal blocks of the six- 
dimensional (0, 2) model and those of the the level 1 WZW model in two dimensions 
make one wonder if there might be an analog in six dimensions of the WZW models 
at higher level. All one can say here is that the usual (0, 2) model has appeared in 
string theory in many ways and as of yet there is no sign of a hypothetical higher 
level analog. 

4.2. What Is Next? In view of what we have said, if we specialize to six- 
manifolds of the form M§ = M4 x T 2 , where we keep the two-torus T 2 fixed 
and let only M4 vary, the six-dimensional (0,2) theory gives a good framework 
for understanding geometric Langlands. 

We can do other things with this theory, since we are free to consider more 
general six-manifolds. This will be our topic in Section 5. But perhaps we should 
first address the following question. Is this the end? Or will physicists come back 
next year and say that geometric Langlands should be derived from a theory above 
six dimensions? 

There is a precise sense in which six dimensions is the end. It is the maximum 
dimension for superconformal field theory, according to an old result of Nahm [21] . 
To get farther, one needs a different kind of theory. 
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If one wishes to go beyond six dimensions, the next stop is presumably string 
theory (dimension ten). Indeed, the existence and most of the essential properties of 
the six-dimensional QFT that underlies four-dimensional electric-magnetic duality 
are known primarily from the multiple relations this theory has with string theory. 

5. Geometric Langlands Duality For Surfaces 

5.1. Circle Fibrations. As we have discussed, one of the most basic proper- 
ties of the six-dimensional (0, 2) theory is that when formulated on Mq = M$ x S 1 , 
it gives rise at long distances to five-dimensional gauge theory on M5. 

The simplest generalization^ of this is to consider not a product M5 x S 1 , but 
a fibration over M5 with S 1 fibers: 

S 1 -> M e 

(5.1) I 

M 5 . 

(For simplicity, we assume that the fibers are oriented.) In this situation, the long 
distance limit is still gauge theory on M5, with gauge group G. But there is an 
important modification. 

We pick on Mq a Riemannian metric that is invariant under rotation of the 
fibers of the U(l) bundle Mq — > M5. Such a metric determines a connection on this 
U(l) bundle, and therefore a curvature two-form / S fl 2 (M^). Let A be the gauge 
field on M5 (so A is a connection on a G-bundle over M5), and let CS(A) be the 
associated Chern-Simons three-form. (As is customary among physicists, we will 
normalize this form so that its periods take values in M./2-kZ.) Then the twisting of 
the fibration Mq — > M5 results in the presence in the long distance effective action 
of an additional term AI that roughly speaking is 

(5.2) AI =^~f /ACS(A). 

To be more precise, one should define — iAI as the integral of a certain Chern- 
Simons five-form for the group U(X) x G. This Chern-Simons five-form is associ- 
ated to an invariant cubic form on the Lie algebra of U(l) x G that is linear on 
the first factor of this Lie algebra and quadratic on the second. Since AI is i times 
the integral of a Chern-Simons form, AI is well-defined and gauge-invariant mod 
2iriZ assuming that M5 is a compact manifold with boundary. This ensures that 
exp(— AI) is well-defined as a complex number, so that it is possible to include a 
factor of exp(— AI) in the integrand of the path integral of five-dimensional super- 
symmetric gauge theory on M5. (Saying that AI appears as a term in the effective 
action means precisely that the integrand of the path integral has such a factor.) 

5.2. Allowing Singularities. However, it is natural to relax the conditions 
that we have imposed so far. Describing Mq as a U(l) bundle over some base M 5 
amounts to exhibiting a free action of the group U(l) on Mq; if such an action 
is given, one simply defines M5 = Mq/U(\) and then Mq is a U(l) bundle over 
A/5. Clearly, a more general situation is to consider a six-manifold Mq together 
with a non-trivial action of the group U(l). After possibly replacing U(l) by a 



"The material in this section was presented in more detail in lectures 
at the IAS in the spring of 2008. Notes by D. Ben-Zvi can be found at 
http://www.math.utexas.edu/users/benzvi/GRASP/lectures/IASterm.html. 
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finite quotient of itself (to eliminate a possible finite subgroup that acts trivially), 
we can assume that [7(1) acts freely on a dense open set in Me. The quotient 
M5 = Mq/U(1) is a five-manifold possibly with singularities where the [7(1) action 
is non-free. 

The above description, with the term (!5-2[) in the effective action, is applica- 
ble away from the non-free locus in M5 (which consists of the points in M5 that 
correspond to non-free orbits in Mq). Along the non-free locus, one should expect 
the gauge theory description to require some kind of modification. What sort of 
modification is needed depends on how the [7(1) action fails to be free. U(l) may 
have non-free orbits in codimension 2, 4, or 6, and these non-free orbits may be 
either fixed points of the whole group, or semi-free orbits whose stabilizer is a finite 
subgroup of U{1). (To characterize the local behavior, one also needs to specify the 
action of [7(1) in the normal space to the non-free locus. Further, though this will 
not be important for our purposes, in general one wishes to allow the possibility of 
a [7(1) symmetry that acts via a homomorphism to the i?-symmetry group Sp(A), 
in addition to acting geometrically on Mq.) 

Thus, for a full analysis of this problem, there are many interesting cases to 
consider, most of which have not been analyzed yet. A simple example is that 
[7(1) may act on M$ with a fixed point set of codimension 2, in which case M5 is 
a manifold with boundary. Thus a natural boundary condition in five-dimensional 
supersymmetric gauge theory will have to appear. 

For our purposes, we will consider just one situation, in which one knows the 
appropriate modification of the effective field theory that occurs near the excep- 
tional set in M5 . This is the case of a codimension 4 fixed point locus W such that 
the action of [7(1) on the normal space to W can be modeled by the natural action 
of [7(1) on C 2 S R 4 . 

Thus, focussing on the normal space to W, we take [7(1) to act on C 2 by 
(zi, Z2) — ► (e l9 zi, e 10 22)7 for e 10 £ U(l). Clearly, this gives an action of [7(1) on C 2 
that is free except for an isolated fixed point at the origin. Somewhat less obvious 
- but elementary to prove - is that the quotient C 2 /[7(l) is actually a smooth 
manifold. In fact, it is a copy of R 3 : 

(5.3) £ 2 /U(l)^R 3 . 

We can get this statement by taking a cone over the Hopf fibration. The Hopf 
fibration is the 1/(1) bundle S 3 -> S 2 . A cone over S 3 is R 4 = C 2 , while a cone over 
S 2 is R 3 . So, writing for the origin in R 4 or R 3 , R 4 \{0} is a [7(1) bundle over 
R 3 \{0}. Gluing back in the origin on both sides, we arrive at the assertion (|5.3[) . 

It follows from (|5.3p that if U(l) acts on Mq freely except for a codimension 
4 fixed point set W as just described, then M 5 — Af 6 /[7(1) is actually a smooth 
manifold. A few simple facts about the geometry of M5 deserve attention. One 
obvious fact is that W is naturally embedded as a codimension 3 submanifold of M5. 
Moreover, it is only away from W that the natural projection Afg — > M5 = Mq/U(1) 
is a [7(1) fibration. This projection thus gives a [7(1) bundle over M 5 \W, which 
topologically cannot be extended over M5. The obstruction to extending the [7(1) 
bundle can be measured as follows. Let S be a small two-sphere in M^\W that has 
linking number 1 with W. (One can construct a suitable S by choosing a normal 
three-plane N to W at some chosen point p € W and letting S consist of points 
in N a distance e from p, for some small e.) Then the [7(1) bundle over M$\W, 
restricted to S, has first Chern class 1. 
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This fact can be expressed as an equation for the curvature two-form / of the 
U(l) bundle over M§\W . As a form on M^\W, f is closed, obeying df — 0. But 
/ has a singularity along W which can be characterized by the statement 



where 5w is the Poincare dual to W. 

5.2.1. Role Of W In The Quantum Theory. In light of this information, let us 
consider now the (0,2) theory of type G formulated on Mq, and its reduction to 
an effective description on M5. Away from W, as we have already discussed, the 
effective theory on M5 is simply supersymmetric gauge theory with gauge group G, 
and with the additional interaction (|5.2[) that reflects the twisting of the fibration 
Mq — > M5. The gauge held is a connection on a G-bundlc E — > M5. 

However, there is an important and very interesting modification along W . 
This modification results from the fact that the interaction AI is not well-defined 
in the usual sense. We can define a Chern-Simons five-form on M$\W for the 
group U(l) x G, but as M$\W is not compact, the integral of this form is not 
gauge-invariant, even modulo 2n. 

Consequently, exp(— A7), the corresponding factor in the path integral, is not 
well-dehned as a complex number, but as a section of a certain complex line bundle 
C. £ is a line bundle over the space of all G-valued gauge fields, modulo gauge 
transformations, on W. More exactly, £ is a line bundle over the space of all 
connections on E\w modulo gauge transformations (l?|w is simply the restriction 
to W of the G bundle E — > M5). We write A for the space of connections on 
E\\y and Q for the group of gauge transformations; then C is a line bundle over 
the quotient A/Q (or equivalently, a CJ-invariant line bundle over A). In fact, C 
is the fundamental line bundle over A/G, often loosely called the determinant line 
bundle. (The motivation for this terminology is that if G = SU(n) or U(n) for 
some n, then C can be defined as the determinant line bundle of a d operator. It 
can also be defined as the Pfaffian line bundle of a Dirac operator if G = SO{n) or 



The characterization of C can be justified as follows. The interaction AI as 
defined in (|5.2|) does not depend on a choice of gauge for the U(l) bundle M e \W — > 
Ms\W, as it is written explicitly in terms of the ?7(1) curvature /. On the other 
hand, under an infinitesimal G gauge transformation A — > A — d^e, the Chern- 
Simons three-form CS(A) transforms by CS(A) — * CS(A) + dX 2 , where X 2 is 
known to physicists as the anomaly two-form (explicitly, X 2 = (1/4tt) Tr edA). 
Substituting this gauge transformation law in (|5.2|) . integrating by parts, and using 
(|5.4[) , we see that under such a gauge transformation, AI transforms by 



which is equivalent to saying that exp(— AI) should be understood as a section of 
the line bundle £. 

Physicists would describe this situation by saying that the factor exp(— AI) in 
the path integral has an anomaly under gauge transformations that are non-trivial 
along W. The anomaly must be canceled by incorporating in the theory another 
ingredient with an equal and opposite anomaly. This additional ingredient must be 
supported on W (since away from W we already know what is the right effective 
field theory). The theory that does the job is the two-dimensional (holomorphic) 



(5.4) 



df = 2ir5w 



Sp(2n).) 



(5.5) 
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WZW model (or in other words, current algebra, as explained in footnote [7]) on W, 
at level 1. 

This then is the secret of W: it supports this particular two-dimensional quan- 
tum field theory. This is the main fact that we will use in interpreting recent 
mathematical results |24l 1251 126] about instantons and geometric Langlands for 
surfaces. 

5.2.2. More Concrete Argument. This somewhat abstract argument can be re- 
placed by a much more concrete one if G is a group of classical type, rather than an 
exceptional group. (A similar analysis has been made independently for somewhat 
related reasons in (27) . See also |28j . The following discussion requires more de- 
tailed input from string theory than the rest of the present article, and the reader 
may wish to jump to section l5~3l ) The simplest case is that G is SU(n) or even 
better U(n). We use the fact |29j that the (0,2) model of U(n) describes the low 
energy behavior of a system of n parallel M5-branes. We consider M-theory on 
R 7 x TN, where TN is the Taub-NUT space, a certain hyper-Kahler four-manifold 
that topologically is R 4 . (It is described in detail in section l5~4l ) TN has a U(l) 
symmetry with TN/?7(1) = R 3 , as suggested by eqn. (|5.3|k we denote as the point 
in R 3 that corresponds to the U(l) fixed point in TN. Inside R 7 x TN, we consider 
n M5-branes supported on R 2 x TN (for some choice of embedding R 2 C R 7 ); this 
gives a realization of the (0, 2) theory of type U(n) on R 2 x TN. We want to divide 
by the U(l) symmetry of TN to reduce the six-dimensional (0, 2) model supported 
on the M5-branes to a five-dimensional description. This may be done straightfor- 
wardly. For any seven-manifold Qj, M -theory on Qr x TN is equivalent [30] to 
Type IIA superstring theory on Q-j x R 3 with a D6-brane supported on Qr x {0}. 
So Af-theory on R 7 x TN is equivalent to Type IIA on R 7 x R 3 with a D6-brane 
supported at ]R 7 x {0}. In this reduction, the n M5-branes on R 2 x TN turn into 
n D4-branes supported on R 2 x R 3 . The low energy theory on the D4-branes is 
Af = 4 super Yang-Mills theory with gauge group U(n). The D4-branes intersect 
the D6-brane on the Riemann surface W — R 2 x {0}, and a standard calculation 
(which uses the fact that the D4-branes and the D6-brane intersect transversely on 
W) shows the appearance on W of U(n) current algebra at level 1. The behavior 
of the (0, 2) model of type D„ can be analyzed similarly by replacing R 7 in the 
starting point with R 5 /Z 2 x R 2 . 

5.3. Compactification On A Hyper-Kahler Manifold. We are going to 
consider the (0, 2) theory in a very special situation. We take Mq = R x S 1 x X, 
where X will be a hyper-Kahler four-manifold. We think of R as parametrizing 
the "time" direction. On Mg, we take the obvious sort of product metric, giving 
circumference 2ir to S . We could take the metric on Mq to be of Euclidean 
signature (which would agree well with some of our earlier formulas), but it is 
actually more elegant in what follows to use a Lorentz signature metric, that is a 
metric of signature — h + + H — h, with the negative eigenvalue corresponding to the 
R directiono 



^One of the important general facts about quantum field theory, as remarked in footnote [4] is 
that in the world of unitary, physically sensible quantum field theories with positive energy — such 
as the six-dimensional (0, 2) model considered here - it is possible in a natural way to formulate the 
"same" quantum field theory on a space of Euclidean or Lorentzian signature. In the following 
analysis, the main thing that we gain by using Lorentz signature is that the supersymmetry 
generators are hermitian and the energy is bounded below. 
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The most obvious ordinary or bosonic conserved quantities in this situation are 
the ones that act geometrically: the Hamiltonian H, which generates translations 
in the R direction, the momentum P, which generates rotations of S , and possible 
additional conserved quantities associated with symmetries of X . The (0, 2) model 
also has a less obvious bosonic symmetry group; this is the i?-symmetry group 
Sp(4), mentioned in Remark 14.21 Because R x S 1 is flat and X is hyper-Kahler, 
so that Mq = R x S 1 x X admits covariantly constant spinor fields, there are 
also unbroken supersymmetries. In fact, there are eight unbroken supersymmctries 
Q a , a = 1, .. .,8; they are hermitian operators that transform in the fundamental 
representation of the P-symmetry Sp(4) (which has real dimension 8). They com- 
mute with H and P, and obey a Clifford-like algebra. With a suitable choice of 
normalizations and orientations, this algebra reads 

(5.6) {Q a ,Qp} = 2S aP {H-P). 

Accordingly, the operator H — P is positive semi-definite; it can be written 
in many different ways as the square of a Hermitian operator. States that are 
annihilated by H — P are known as BPS states and play a special role in the 
quantum theory [31] . We write V for the space of BPS states. V admits an action 
of U{1) x Sp(4) (or possibly a central extension thereof), where U(l) is the group 
of rotations of S 1 and Sp(A) is the P-symmetry group. The center of Sp(4) is 
generated by an element of order 2 that we denote as (— 1) F ; it acts as +1 or —1 
on bosonic or fermionic states, respectively. So in particular, V is Z2-graded by 
the eigenvalue of (— 1) F . We refer to V, with its action of U(l) x 5p(4), as the 
spectrum of BPS states. One important general fact is that P is bounded below 
as an operator on V; indeed, on general grounds, H is bounded below in the full 
Hilbert space of the (0, 2) theory, while H = P when restricted to V. 

Certain features of the spectrum of BPS states are "topological invariants," 
that is, invariant under continuous deformations of parameters. (In the present 
problem, the relevant parameters are the moduli of the hyper-Kahler metric of 
X.) The most obvious such invariant is the "elliptic genus," F(q) = Try (— l) F q p , 
where q is a complex number with \q\ < 1. (It has modular properties, since it 
can be represented by the partition function of the (0,2) model on £ x X, where 
£ is an elliptic curve whose modular parameter is r = hiq/2iTi.) F(q) is invariant 
under smooth deformation of the spectrum by virtue of the same arguments that 
are usually used to show that the index of a Fredholm operator is invariant under 
deformation. 

In the present problem, the whole spectrum of BPS states, and not only the in- 
dex, is invariant under deformation of the hyper-Kahler metric of X. One approach 
to proving this uses the fact that V can be characterized as the cohomology of Q, 
where Q is any complex linear combination of the hermitian operators Q a that 
squares to zero. Picking any one complex structure on X (from among the complex 
structures that make up the hyper-Kahler structure of X), one makes a judicious 
choice of Q to show that the spectrum of BPS states is invariant under deformations 
of the Kahler metric of X (keeping the chosen complex structure fixed) . Repeated 
moves of this kind (specializing at each stage to a different complex structure and 
therefore a different choice of Q) can bring about arbitrary changes of the hyper- 
Kahler metric of X, so the spectrum of BPS states is independent of the moduli of 
X. 
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In our discussion in section 15. 4\ we compare two computations of V in two 
different regions of the moduli space of hyper-Kahler metrics on X. The results 
must be equivalent in view of what has just been described. 

Remark 5.1. There is some sleight of hand here, as the arguments above have 
assumed X to be compact, and we will use the results for noncompact X . So some 
refinement of the arguments is actually needed. 

5.4. Taub-NUT Spaces. Now the question arises of what sort of hyper- 
Kahler four-manifold X we will select in the above construction. 

We will choose X to admit a triholomorphic U(l) symmetry, that is, a U(l) 
symmetry that preserves the hyper-Kahler structure of X. (Among other things, 
this ensures that this U(l) also commutes with the unbroken supersymmetries Q a 
of eqn. (|5-6[) . ) Hyper-Kahler four-manifolds with triholomorphic U(l) symmetry 
are highly constrained |32) . The general form of the metric is 

(5.7) ds 2 = Udx- dx + ^(d9 + uj • df) 2 , 

where x parametrizes R 3 , U is a harmonic function on R 3 , and (away from singu- 
larities of U) 8 is an angular variable that parametrizes the U(l) orbits. 

This form of the metric shows that the quotient space X/U(l) (assuming X is 
complete) is equal to R 3 . Indeed, the natural projection X — > X/U(l), which was 
considered in section 15. 2i has a special interpretation in this situation. It is the 
hyper-Kahler moment map f± and it is a surjective map to R 3 : 

(5.8) /2 : X -> R 3 . 

The most obvious hyper-Kahler four-manifold with a triholomorphic f (1) sym- 
metry is R 4 . This corresponds to the choice U = l/2\x\. The U(l) action on R 4 
has a fixed point at the origin (where U has a pole and the radius of the U(l) orbits 
vanishes, according to (|5.7p ). This fixed point is precisely of the sort considered 
in section 15.21 To verify this, begin with the fact that the rotation group of R 4 
has SU(2)l x SU(2)r for a double cover; SU(2)l and SU(2)r are two copies of 
SU(2). We can pick a hyper-Kahler structure on X compatible with its flat metric 
such that SU (2) l rotates the three complex structures and SU {2)r preserves them. 
We simply take f7(l) to be a subgroup of SU(2)r. Then, upon picking a complex 
structure on R 4 that is invariant under SU(2)l x U(l) (this complex structure is 
not part of its {/(l)-invariant hyper-Kahler structure), we can identify R 4 with C 2 
and U(l) acts in the natural way [z\, z 2 ) — > (e* 6 ] Z\, e l6 'z 2 ). This then is the situation 
that was considered in section 15.21 and the statement (|5.8p gives a hyper-Kahler 
perspective on the fact that the quotient R 4 /t/(l) is R 3 , as was asserted in (I5.3[) . 

Although R 4 has the properties we need from a topological point of view, there 
is a different hyper-Kahler metric on R 4 that will be more useful for our application 
in section 15.51 This is the Taub-NUT space, which we will call TN. To describe 
TN explicitly, we simply choose U to be 



where R is a constant. Looking at (|5.7j) . the interpretation of R is easy to under- 
stand: the U(l) orbits have circumference 2tt/VU, which at infinity approaches 
2-kR. The flat metric on R 4 is recovered in the limit R — > oo; in R 4 , of course, the 
circumference of an orbit diverges at infinity. 
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Accordingly, the hyper-Kahler metric on TN is quite different at infinity from 
the usual flat hyper-Kahler metric on R 4 . However, in one sense the difference 
is subtle. If we pick any one of the complex structures that make up the hyper- 
Kahler structure, then it can be shown that, as a complex symplectic manifold in 
this complex structure, TN is equivalent to R 4 = C 2 . 

A more general choice of X is also important. First of all, naively we could pick 
an integer k > 1 and take X = R 4 /Zfc, where Z& is the subgroup of U(l) consisting 
of points of order k. Certainly R 4 /Zfc has a (singular) hyper-Kahler metric with 
a triholomorphic U(l) symmetry. The singularity at the origin of R 4 /Zfc is known 
as an A^-i singularity. It is possible to make a hyper-Kahler resolution of this 
singularity, still with a triholomorphic U(l) symmetry. This is accomplished by 
picking k points x\, . . . Xk in R 3 and setting U = \ Ylj=i Vl^ — This gives 
a complete hyper-Kahler manifold which is smooth if the Xj 3X6 distinct. As a 
complex symplectic manifold in one complex structure, it can be described by an 
equation 



where f(w) is a k th order monic polynomial. This is the usual complex resolution 
of the Afc_i singularity. In this description, the holomorphic symplectic form is 
du A dv / f'(w), and the triholomorphic U(l) symmetry is u — > Am, v — > \~ 1 v. 

However, again, a generalization is more convenient for our application in sec- 
tion [3751 We simply add a constant to U and take 



This gives a complete hyper-Kahler manifold, originally constructed in [33. . that 
we call the multi-Taub-NUT space and denote as TN^. 

As a complex symplectic manifold in any one complex structure, TNj is inde- 
pendent of the parameter R and coincides with the usual resolution (|5. 10[) of the 
A/j_i singularity. However, the addition of a constant to U markedly changes the 
behavior of the hyper-Kahler metric at infinity. Just as in the k = 1 case that was 
considered earlier, the asymptotic value at infinity of the circumference of the fibers 
of the fibration TN fc ->■ R 3 is 2nR. 

The space TN^ is smooth as long as the Xj are distinct. When r of them 
coincide, an A r _i singularity develops, that is, an orbifold singularity of type R 4 /Z r . 

In general, for x — > Xj, we have U — > oo. So at those points, and only there, 
the radius of the 1/(1) fibers vanishes. The k points x = Xj are, accordingly, the 
fixed points of the triholomorphic U(l) action. 

5.4.1. A Note On The Second Cohomology. We conclude this subsection with 
some technical remarks that will be useful in section [STBI (but which the reader may 
choose to omit). Topologically, TN^ is, as we have noted, the same as the resolution 
of the Afc_i singularity. A classic result therefore identifies i/ 2 (TNfc,Z) with the 
root lattice of the group Afc_i = SU{k). 

However, TN& is not compact and one should take care with what sort of 
cohomology one wants to use. It turns out that another natural definition is useful. 
We define an abelian group as follows: an element of I\. is a unitary line bundle 
C — > TNfc with anti-selfdual and square-integrable curvature and whose connection 
has trivial holonomy when restricted to a fiber at infinity of jl : TNfc — > R 3 . T is 



(5.10) 



uv = f(w), 



(5.11) 
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a discrete abelian group with a natural and integer-valued quadratic form, defined 
as follows; if £ is a line bundle with anti-selfdual curvature F, we define (£, C) — 
-Jtn^AF/^ 2 . 

It turns out that Tk = Z fc , with the quadratic form corresponding to the qua- 
dratic function of k variables y\ H 1" Vu- (A basis of is described in section 

15.4.21 ) Thus, r corresponds to the weight lattice of the group U(k). In many string 
theory problems involving Taub-NUT spaces, one must use Ffc as a substitute for 
H 2 (TNfc , Z) , which does not properly take into account the behavior at infinity. 

This is notably true if one considers the (0, 2) model on Mq = W x TNfc, for W 
a Riemann surface. In section |4~T| we explained that the (0, 2) model on a compact 
six-manifold M$ has a space of conformal blocks that is obtained by quantizing, 
in a certain sense, the finite abelian group H 3 (Mq, Z). For M@ = W X TNfc, the 
appropriate substitute for this group is 

(5.12) H 3 (W x TN fc ,Z) = H l (W,Z) ®T k . 

5.4.2. Basis OfTk- It is furthermore true that Tk has a natural basis corre- 
sponding to the U(l) fixed points Xj, j — 1, . . . , k. To show this, we first describe 
a dual basis of noncompact two-cycles. For j = 1, . . . , k, we let lj be a path in R 3 
from Xj to oo, not passing through any x r for r ^ j. Then we set Cj = fl~ 1 (£j). Cj 
is a noncompact two-cycle that is topologically R 2 . A line bundle C that represents 
a point in is trivialized at infinity on Cj because its connection is trivial on the 
fibers of \x at infinity. So we can define an integer J c c\(C). One can pick a basis of 

Tk consisting of line bundles C r such that J c c\{C r ) = Sj r . (The C r are described 
explicitly in [36].) 

Now let us reconsider the definition of H 3 (W x TNfc, Z) in (|5.12p . From what 
we have just said, H 1 (W, Z) ® Tk has a natural decomposition as the direct sum of 
copies H^(W, Z) of H 1 (W, Z) associated with the fixed points: 

(5.13) H 3 (W x TN fc ) - ® k j=1 H l {j) {W,Z). 

Upon quantization, this means that the space of conformal blocks of the (0, 2) 
model on W x TNfc is the tensor product of k factors, each of them isomorphic to 
the space of conformal blocks in the level 1 WZW model (associated with the group 
G) on W. The factors are naturally associated to the U(l) fixed points. 

Remark 5.2. Similarly, we can enrich the definition of the two-dimensional 
characteristic class a of a bundle over TNfc. Normally, a takes values in 
if 2 (TNfc, Z). However, suppose E — > TNfc is a Gai bundle that is trivialized over 
each fiber at infinity of fl : TNfc — > R 3 . Then E is trivialized at infinity on each 
Cj, so one can define a pairing aj = (a.Cj) for each j; the cij take values in Z. 
Equivalently, we can consider a as an element of £T 2 (TNfc,Z) — Tk <£> %Z. This 
also has an analog if we are given a conjugacy class C C G?d and the monodromy 
of E on each fiber at infinity lies in C. Then one can define a C-dependent torsor 
for the group H 2 (TNk, Z), and one can regard a as taking values in this torsor. 
Concretely, this means that, once we pick a path in G^j from C to the identity (an 
operation that trivializes the torsor), we can define the elements aj S Z as before. 
Two different paths from C to the identity would differ by a closed loop in Gsd, 
corresponding to an element b G Z; if we change the trivialization of the torsor by 
changing the path by b, then the aj are shifted to aj + b. (b is the same for all j, 
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since the regions at infinity in the two-cycles Cj can be identified, by taking the 
paths £j to coincide at infinity.) 

5.5. Two Ways To Compute The Space Of BPS States. Now we are 

going to study in two different ways the space of BPS states of the (0, 2) model 
formulated on 



The results will automatically be equivalent, as explained at the end of section [5751 
Me admits an action of U(l) x U(l)' (the product of two factors of U(l)), where 
U(l) acts by rotation of S 1 , and U(l)' is the triholomorphic symmetry of TNfc. We 
choose a product metric on Mq, such that S 1 has circumference 2ttS, and TNfc has 
a hyper-Kahler metric in which the U(X)' orbit has asymptotic circumference 2nR. 
In section [5731 we took S = 1; in any event, because the (0, 2) model is conformally 
invariant, only the ratio R/S is relevant. 

U(l) and U(l)' play very different roles in the formalism because of the struc- 
ture of the unbroken supersymmetry algebra, which we repeat for convenience: 



Here P is the generator of the U(l) symmetry. It appears in the definition of 
the elliptic genus F(q) = Try g p (— 1) F , where V is the space of BPS states. The 
function F(q) has modular properties, so if it is nonzero (as will turn out to be the 
case), there are BPS states with arbitrarily large eigenvalues of P. By contrast, it 
turns out that U(l)' acts trivially on V. 

One of our two descriptions of V will be good for S — > or equivalently R — > oo; 
the other description will be good for R — > or equivalently S — > oo. Comparing 
them will give a new perspective on the results of [24\ \25\ I26j . 

5.5.1. Description I. For S — ► 0, the low energy description is by gauge theory 
on M 6 /U(l) = Ix TNfc. As U(l) acts freely, we need not be concerned here with 
the behavior at fixed points. As the metric of M 6 is a simple product S 1 x M 5 
(with M5 = M x TNfc = Mq/U(1)), we also need not worry about the interaction 
described in cqn. (15. 2| . So we simply get maximally supersymmetric Yang-Mills 
theory on K x TNfc, with gauge group G. 

In formulating gauge theory on R x TNfc, we specify up to conjugacy the ho- 
lonomy U of the gauge field over a fiber at infinity of /2 : TNfc — > IR 3 . This choice 
(which has a six-dimensional interpretation) leads to an important bigrading of the 
physical Hilbert space Tt of the theory and in particular of the space V of BPS 
states. First of all, let H be the subgroup of G that commutes with U. Classically, 
one can make a gauge transformation that approaches at infinity a constant ele- 
ment of H; quantum mechanically, to avoid infrared problems, the constant should 
lie in the center of H. So the center of H acts on H and V. We call this the 
electric grading. (The center of H is, of course, abelian, and the eigenvalues of its 
generators are called electric charges.) 

A second "magnetic" grading arises for topological reasons. When U 7^ 1, 
the topological classification of finite energy gauge fields on TNfc becomes more 
elaborate. Near infinity on TNfc, the monodromy around S 1 reduces the structure 
group from G to H, and the bundle can be pulled back from an if-bundle over the 
region near infinity on IR 3 . Infinity on R 3 is homotopic to S 2 , so we get an iJ-bundle 
over S 2 . The Hilbert space of the theory is then graded by the topological type of 



(5.14) 



M 6 = M x S 1 x TNfc. 



(5.15) 



{Q a ,Qf3} = 26 a(3 {H~P). 
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the -ff-bundle. We call this the magnetic grading. (Its components corresponding 
to U(l) subgroups of H are called magnetic charges.) 

According to [6] , electric-magnetic duality exchanges the electric and magnetic 
gradings. In our context, this will mean that the electric grading in Description 
I matches the magnetic grading in Description II, and viceversa. In the simplest 
situation, if U is generic, then H is a maximal torus T of G; the electric and 
magnetic gradings correspond to an action of T and T v , respectively. 

Actually, to extract the maximum amount of information from the theory, 
we want to allow an arbitrarily specified value of the two-dimensional character- 
istic class a. As described in Remark 15. 2i a takes values in a certain torsor for 
H 2 (TNk, Z), which means, modulo a trivialization of the torsor, that a assigns an 
element of Z to each fixed point. (The origin of a in six dimensions was discussed 
in section HTTP Roughly speaking, allowing arbitrary a means that we do G^j gauge 
theory, but there is a small twist: to extract the most information, we divide by 
only those gauge transformations that can be lifted to the simply-connected form 
G. This means that the monodromy U can be regarded as an element of G (up 
to conjugacy), and similarly that in Description II, we meet representations of the 
Kac-Moody group of G (not G^). 

In gauge theory onKx TNfc, U(l)' acts geometrically, generating the triholo- 
morphic symmetry of TN^. But how does U(l) act? The answer to this question is 
that in this description, the generator P of U(l) is equal to the instanton number 
/. (This fact is deduced using string theory.) The instanton number is defined 
via a familiar curvature integral, normalized so that on a compact four-manifold 
and with a simply-connected gauge group, it takes integer values. In the present 
context, the values of the instanton number are not necessarily integers, because 
TNfc is not compact. The analog of integrality in this situation is the following. 
First, one should add to the instanton number / a certain linear combination of the 
magnetic charges (with coefficients given by the logarithms of the monodromies). 
Let us call the sum /. Then there is a fixed real number r, depending only on the 
monodromy at infinity and the characteristic class a, such that I takes values in 
?' + Z. So in this description, eigenvalues of P are not necessary integers, but (for 
bundles with a fixed a and U) a certain linear combination of the eigenvalues of P 
and the magnetic charges are congruent to each other modulo integers. 

Since P' generates the U(l)' symmetry of TN&, one might expect its eigenvalues 
to be integers, but here we run into the electric charges. There is an operator that 
generates the triholomorphic symmetry and whose eigenvalues are integers; it is 
not simply P' but the sum of P' and a central generator of H (this generator is 
the logarithm of the monodromy at infinity), or in other words the sum of P' and 
a linear combination of electric charges. 

What are BPS states in this description? Classically, the minimum energy 
fields of given instanton number are the instantons - that is the gauge fields that are 
independent of time and are anti-selfdual connections on TN^ . Instantons on TN& 
have recently been studied by Z?-brane methods |34L 1351 136] . In particular [34) . 
certain components of the moduli space M. of instantons on TN^, when regarded as 
complex symplectic manifolds in one complex structure, coincide with components 
of the moduli space of instantons on the corresponding ALE space (the resolution 
of R 4 /Zfc). All components of instanton moduli space on the ALE space arise in 
this way, but there are also components of instanton moduli space on TNfc that 
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have no analogs for the ALE space. (According to [34] and as explained to me by 
the author of that paper, these are the components of nonzero magnetic charge, 
corresponding to nonzero electric charge in Description II.) 

An instanton is a classical BPS configuration, but to construct quantum BPS 
states, we must, roughly speaking, take the cohomology of the instanton moduli 
space M.. Actually, M. is not compact and by "cohomology," we mean in this 
context the space of L 2 harmonic forms on M. . (These are relevant for essentially 
the same reasons that they entered in one of the early tests of electric-magnetic 
duality [37] .) So V is the space of L 2 harmonic forms on A4. Of course, to construct 

V we have to include contributions from all components Ai n of A4: 

(5-16) v = e„ff£ 2 haim (M n ), 

where we write H^ 2 harm f° r ^ ne s P ace °f L 2 harmonic forms. The action of P on 

V is multiplication by the instanton number, and similarly the magnetic grading 
is determined by the topological invariants of the bundles parametrized by a given 
Ai n - P' and the electric charges act trivially on V because they correspond to 
continuous symmetries of M. n that act trivially on its cohomology. 

Each M. n is a hyper-Kahler manifold, and accordingly the group Sp{A) - which 
in the present context is the i?-symmetry group (as explained in Remark l4.2p - acts 
on the space of L 2 harmonic forms on M. n and hence on V. However, as in similar 
problems [38] . it seems likely that Sp(A) acts trivially on these spaces. (This is 
equivalent to saying that L 2 harmonic forms exist only in the middle dimension 
and are of type (p,p) for every complex structure.) This would agree with what 
one sees on the other side of the duality, which we consider next. 

Remark 5.3. If we simply replace TNfc by R 4 (with its usual metric) in this 
analysis, we learn in the same way that BPS states of the (0, 2) model on Rx 5* 1 x R 4 
correspond to L 2 harmonic forms on instanton moduli space on R 4 , with its usual 
metric. The same holds with an ALE space instead of R 4 . The advantage of TN& 
over R 4 or an ALE space is that there is an alternative second description. 

5.5.2. Description II. The other option is to take R — > 0. In this case, the 
fibers of : TNfc — > R 3 collapse, so to go over to a gauge theory description, we 
replace TN& by R 3 , with special behavior at the U(l) fixed points Xj, j = 1, . . . , k, 
where holomorphic WZW models will appear. We get a second description, then, 
in terms of maximally supersymmetric gauge theory on M5 = R x S 1 x R 3 , with 
level 1 holomorphic WZW models of type G supported on the k two-manifolds 
W 3 = Rx S 1 x xj, j = l,...,k. 

Once again, we must specify the holonomy U at infinity of the gauge field 
around S . This is simply the same as the corresponding holonomy at infinity in 
Description I. Suppose for a moment that U is trivial. Then we also must pick, for 
each Xj, j = 1, . . . , k, an integrable representation of the affine Kac-Moody algebra 
of G at level 1. For a simply-laced and simply-connected group, the integrable 
representations are classified by characters of the center Z of G, or, as there is 
a perfect pairing Z x Z — > U(l), simply by Z. So for each j, we must give an 
element a,j € Z. This is precisely the data that we obtained in Description I 
from the characteristic class a S iJ 2 (TNfc, Z). Since the second homology group of 
R x S 1 x R 3 vanishes, there is no two-dimensional characteristic class to be chosen 
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in Description II (matching the fact that there was no Kac-Moody representation 
in Description I). 

More generally, for any U, we can canonically pick up to isomorphism a G- 
bundle on R x S 1 x R 3 with that monodromy at infinity, namely a flat bundle with 
holonomy U around S 1 . In the presence of this flat bundle, the Kac-Moody algebra 
on each S* 1 x Xj is twisted; if 9 is an angular parameter on S , then instead of 
the currents obeying J(6» + 2n) = J{0), they obey J (6 + 2vr) = UJ{6)U~ 1 . The 
representations of this twisted Kac-Moody algebra at level 1 are a torsor for Z - 
the same torsor that we met in Remark 1 5. 2 1 The torsor is the same for each j since 
each Kac-Moody algebra is twisted by the same U. (The torsor property means 
concretely that the representations of the Kac-Moody algebra are permuted if U 
undergoes monodromy around a noncontractible loop in G ac i.) 

In Description II, P generates the rotations of S 1 . For reasons that will become 
apparent, what is important is how P acts on the representations of the Kac- 
Moody algebra. In the Kac-Moody algebra, P corresponds to the operator - usually 
called Lq - that generates a rotation of the circle. First set U = 1. Then Lq has 
integer eigenvalues in the vacuum representation of the Kac-Moody algebra (that 
is, the representation whose highest weight is G-invariant) . In a more general 
representation (but still at U = 1), Lq has eigenvalues that are congruent mod 
Z to a fixed constant r that depends only on the highest weight. This matches 
the fact that, in Description I (at U = 1) the instanton number takes values in 
r + Z where r depends only on the characteristic class a. In the Kac-Moody theory, 
when the twisting parameter U is varied away from 1, the eigenvalues of Lq shift. 
However (recalling that H is the commutant of U in G) , one can add to Lq a linear 
combination of the generators of H to make an operator Lq with the property that 
in a given representation of the twisted Kac-Moody algebra, its eigenvalues are 
congruent mod Z. Thus, electric charges play precisely the role in Description II 
that magnetic charges play in Description I. 

On the other hand, in Description II, P' is the instanton number of a G-bundlc 
on the initial value surface S 1 x M 3 . If the monodromy U at infinity is trivial, then 
P' is integer-valued, just as in Description I. In general, for any U , a certain linear 
combination of P' and the magnetic charges (with coefficients given as usual by the 
logarithms of the monodromies) takes integer values. This mirrors the fact that 
in Description I, a linear combination of P' and the electric charges takes integer 
values. 

What is the space of BPS states in Description II? Supported on R x S 1 x Xj 
for each j — l,...,k, there is a level 1 Kac-Moody module Wj. This module 
has H — P for all states (mathematically, the representation theory of affine Kac- 
Moody algebras is usually developed with a single Lq operator, not two of them), 
and consists entirely of BPS states. The space of BPS states is simply V = ®j? = iVVj. 
In particular, as the i?-symmetry group Sp(4) acts trivially on the Wj, it acts 
trivially on V. The analogous statement in Description I was explained at the end 
of section 15.5.11 

Comparing the results of the two descriptions, we learn that 



(5.17) 



®* =1 Wj = ® n Hl- 2 harm (M n ). 
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The right hand side is graded by instanton number and magnetic charge, and the 
left hand side by Lq and electric charge. This equivalence closely parallels a central 
claim in [2411251126] . 

A couple of differences may be worthy of note. Our description uses L 2 har- 
monic forms; different versions of cohomology are used in recent mathematical 
papers. Also, our instantons live on TN& with its hyper-Kahler metric, not on the 
resolution of the Afc_i singularity. This does not affect M n as a complex symplectic 
manifold (as long as one considers on the TN& side judiciously chosen components 
of the moduli space [34]), but it certainly affects the hyper-Kahler metric of M. n 
and therefore the condition for an L 2 harmonic form. The components M. n of in- 
stanton moduli space on TN^ that do not have analogs on the resolution of the 
Afc_i singularity are also presumably important. 

One may wonder why we do not get additional BPS states from quantizing the 
moduli space of instantons, as we did in Description I. This can be understood as 
follows. 

Generically, curvature breaks all supersymmetry. In Description I, because the 
curvature of TN& is anti-selfdual, it leaves unbroken half of the supersymmetry. The 
half that survives is precisely the supersymmetry that is preserved by an instanton 
(since an instanton bundle also has anti-selfdual curvature). Hence instantons are 
supersymmetric and must be considered in constructing the space of BPS states. By 
contrast, in Description II, there is no curvature to break supersymmetry. Instead, 
there is a coupling (|5.2|) . which (when extended to include fields and terms that we 
have omitted) leaves unbroken half the supersymmetry, but a different half from 
what is left unbroken by anti-selfdual curvature. The result is that in Description 
II, instantons are not supersymmetric. 

So in Description II, the instanton number and similarly the magnetic charges 
annihilate any BPS state. This implies that P' and the magnetic charges annihilate 
V in Description II, just as P' and the electric charges do in Description I. 

5.5.3. A Note On The Dual Group. The reader may be puzzled by the fact 
that in this analysis of two ways to describe the space of BPS states, we have not 
mentioned the dual group G v . The reason for this is that for simplicity, we have 
limited ourselves to the case that G is simply-laced. When this is so, G and G v 
have the same Lie algebra. Instead of merely comparing G and G v theories, we 
can learn more, as explained in section 14. 1[ by considering G^i bundles with an 
arbitrary two-dimensional characteristic class a. This is what we have done. 

For groups that are not simply-laced, a slight variant of the construction that 
we have used is available [39] . The basic idea is that outer automorphisms of a 
simply-laced group G can appear as symmetries of the (0, 2) model of type G in 
six dimensions. To a simple but not simply-laced Lie group H, one associates a 
pair (G, A), where G is simply-laced and A is an outer automorphism of G. The 
(0,2) model of type G, when formulated on Mq = M$ x S , can be "twisted" 
by A in going around S , in which case the low energy limit on M$ is maximally 
supersymmetric gauge theory of type H y . Now consider Mq — M4 x S 1 x S 1 , with 
circles of radius S and R and a twist by A around S . Repeating the analysis of 
section 2] we get at long distances on M4 xS'a description by H v gauge theory; 
this further reduces on M4 to a description by H v gauge theory with coupling 
parameter r v = iS/R. Alternatively, we get a description by G gauge theory on 
AI4 x S 1 with a twist around S 1 that reduces G to H; this further reduces on M4 
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to H gauge theory with coupling parameter r = iR/S. The comparison of these 
two descriptions is essentially the perspective offered in [39] on electric-magnetic 
duality in four dimensions for non-simply-laced groups. 

We can learn more if we do not reduce all the way to four dimensions. We take 
Mg = R x S 1 x TNfc, where S 1 has circumference 2ttS and the fiber at infinity of 
ft : TNfc — > R 3 has circumference 2irR. For small S, we get Description I, which 
involves H v instantons on TN^. For small R, we get Description II, which now 
involves level 1 modules of the affine Kac-Moody algebra of G, twisted by A. This 
is equivalent to the claim of [24] (although the roles of H and H y are reversed in 
our presentation here). To make contact with the formulation given in [24] . one 
must know that the Langlands or GNO dual of the Kac-Moody group of H v is not 
the Kac-Moody group of H but the A-twisted Kac-Moody group of G. 

Remark 5.4. The construction in the last paragraph does not appear to have 
an analog with a twist around the circle at infinity in TN^. Precisely because this 
circle is contractible in the interior of TN& , it is not possible to twist by a discrete 
symmetry of the (0, 2) model in going around this circle. 

5.5.4. Points With Multiplicity. So far we have considered the points xj to 
be distinct, so that TNfe is smooth. It is important, however, to consider the 
behavior as some of the Xj become coincident. In general, suppose that the Xj 
become coincident for j = ii, . . . , i r . Without any essential change in the following 
remarks, we could allow several subsets of Xj to simultaneously become coincident. 

In Description I, when this happens, TN^ develops a A r _i singularity which is 
an orbifold singularity, locally modeled by ]R 4 /Z r . Gauge theory on R 4 /Z r is defined 
as Z r -invariant gauge theory on R 4 , but the notion of Z r invariance depends on the 
choice of how Z r acts on the fiber of a G-bundle at the fixed point (the origin in 
R 4 ). Such a choice is a homomorphism <p : Z r — > G. When a Z r orbifold singularity 
develops, the space V of BPS states becomes graded by the choice of <j>. This is in 
addition to the grading by the part of the characteristic class a that can be defined 
on the complement of the singularity. 

The dual in Description II is that r of the points Xj that support level 1 holo- 
morphic WZW models become coincident, say at a point y € R 3 . Then the sub- 
manifold R x S 1 x y of Mq supports a level r holomorphic WZW model of type G. 
The affine Kac-Moody group of G supports several inequivalent integrable highest 
weight modules of level r, and when the points Xi 1 , . . . , Xi r become coincident, the 
Hilbert space of the theory decomposes as a direct sum of subspaces transforming 
in different such representations. This also gives a decomposition of the space V of 
BPS states. 

So duality must establish a correspondence between two types of data (0 and 
the relevant part of a on one side; a choice of level r integrable representation on 
the other side). Such a correspondence is used in [24] and can be described as 
follows in physical terms. For simplicity, we set G = Eg, so that we can dispense 
with a. Consider three-dimensional Chern-Simons theory with gauge group G at 
level r. A Wilson loop can be considered in any irreducible representation R that is 
the highest weight of a level r integrable module of the affine Kac-Moody algebra. 
On the other hand, around the Wilson loop, the gauge field acquires a monodromy 
that is an element of G of order r. This gives a correspondence between level r 
integrable modules and conjugacy classes of order r. It should be possible to use 
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string theory arguments to show that this is the correspondence that enters in 
comparing Descriptions I and II, but this will not be attempted here. 

The general story is to consider points y\ , . . . , y s € R 3 with multiplicities 
ri, ...,r a . In our presentation, we obtain this case starting with k — X)i=i r i 
points, all of multiplicity 1 , and letting them coalesce in clumps of the appropriate 
sizes. Then in Description I, we consider a TN^ space that is constrained to have 
singularities of type A n _i for i = 1, . . . , s. Gauge theory at the i th singularity is 
defined by a choice of homomorphisms <fii : Z n — * G. Description II is based on 
supersymmetric gauge theory oi K x S 1 xl 3 , with each K x S 1 x yi supporting a 
holomorphic level WZW model, associated with the level integrable represen- 
tation that corresponds to (pi. 

This correspondence, moreover, is compatible with further coalescences of points, 
in close parallel to the picture of [24] . Any further coalescence of points leads to a 
further decomposition of V on both sides. 

Acknowledgments I thank A. Braverman for explanations of the ideas of |24j . I 
also thank S. Cherkis for careful comments on the manuscript and G. Segal for a 
discussion of the sense in which the (0, 2) theory does not have a Lagrangian. 
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